Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese
نویسندگان
چکیده
Chinese characters are used both in Japanese and Chinese, which are called Kanji and Hanzi respectively. Chinese characters contain significant semantic information, a mapping table between Kanji and Hanzi can be very useful for many Japanese-Chinese bilingual applications, such as machine translation and cross-lingual information retrieval. Because Kanji characters are originated from ancient China, most Kanji have corresponding Chinese characters in Hanzi. However, the relation between Kanji and Hanzi is quite complicated. In this paper, we propose a method of making a Chinese characters mapping table of Japanese, Traditional Chinese and Simplified Chinese automatically by means of freely available resources. We define seven categories for Kanji based on the relation between Kanji and Hanzi, and classify mappings of Chinese characters into these categories. We use a resource from Wiktionary to show the completeness of the mapping table we made. Statistical comparison shows that our proposed method makes a more complete mapping table than the current version of Wiktionary.
منابع مشابه
基於對照表以及語言模型之簡繁字體轉換 (Chinese Characters Conversion System based on Lookup Table and Language Model) [In Chinese]
The character sets used in China and Taiwan are both Chinese, but they are divided into simplified and traditional Chinese characters. There are large amount of information exchange between China and Taiwan through books and Internet. To provide readers a convenient reading environment, the character conversion between simplified and traditional Chinese is necessary. The conversion between simp...
متن کاملThe perception of simplified and traditional Chinese characters in the eye of simplified and traditional Chinese readers
Expertise in Chinese character recognition is marked by analytic/reduced holistic processing (Hsiao & Cottrell, 2009), which depends mainly on readers’ writing rather than reading experience (Tso, Au, & Hsiao, 2011). Here we examined whether simplified and traditional Chinese readers process characters differently in terms of holistic processing. When processing characters that are distinctive ...
متن کاملBuilding a Collation Element Table for a Large Chinese Character Set in YES
YES is a simplified stroke-based method for sorting Chinese characters. It is free from stroke counting and grouping, and thus much faster and more accurate than the traditional method. This paper presents a collation element table built in YES for a large joint Chinese character set covering (a) all 20,902 characters of Unicode CJK Unified Ideographs, (b) all 11,408 characters in the Complete ...
متن کاملPhD Curriculum For the Traditional and Complementary Medicine Program: A Qualitative Comparative Study in the Chinese and Iranian Medical Universities
Background: Recently, the traditional and complementary medicine has largely gained interest among scholars across the globe. Keeping pace with the on-going trend, the Iranian academia have provided the PhD course on Traditional Iranian Medicine. In this study, we attempted to perform a qualitative study, comparing the PhD curricula of the Chinese and Iranian traditional medicine in detail. Met...
متن کاملAn investigation Into Traditional Chinese Medicine Hospitals in China: Development Trend and Medical Service Innovation
Background This paper aims to investigate the development trend of traditional Chinese medicine (TCM) hospitals in China and explore their medical service innovations, with special reference to the changing co-existence with western medicine (WM) at TCM hospitals. Methods Quantitative data at macro level was collected from official databases of China Health Statistical Yearbook and Extracts o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012